首页> 外文OA文献 >Improving Collaborative Filtering based Recommenders using Topic Modelling
【2h】

Improving Collaborative Filtering based Recommenders using Topic Modelling

机译:使用主题改进基于协作过滤的推荐人   造型

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Standard Collaborative Filtering (CF) algorithms make use of interactionsbetween users and items in the form of implicit or explicit ratings alone forgenerating recommendations. Similarity among users or items is calculatedpurely based on rating overlap in this case,without considering explicitproperties of users or items involved, limiting their applicability in domainswith very sparse rating spaces. In many domains such as movies, news orelectronic commerce recommenders, considerable contextual data in text formdescribing item properties is available along with the rating data, which couldbe utilized to improve recommendation quality.In this paper, we propose a novelapproach to improve standard CF based recommenders by utilizing latentDirichlet allocation (LDA) to learn latent properties of items, expressed interms of topic proportions, derived from their textual description. We inferuser's topic preferences or persona in the same latent space,based on herhistorical ratings. While computing similarity between users, we make use of acombined similarity measure involving rating overlap as well as similarity inthe latent topic space. This approach alleviates sparsity problem as it allowscalculation of similarity between users even if they have not rated any itemsin common. Our experiments on multiple public datasets indicate that theproposed hybrid approach significantly outperforms standard user Based and itemBased CF recommenders in terms of classification accuracy metrics such asprecision, recall and f-measure.
机译:标准协作过滤(CF)算法仅以隐式或显式评级的形式利用用户和项目之间的交互来生成建议。在这种情况下,用户或项目之间的相似性完全基于评级重叠来计算,而没有考虑所涉及的用户或项目的显式属性,从而限制了它们在具有非常稀疏的评级空间的领域中的适用性。在电影,新闻或电子商务推荐者等许多领域中,可提供大量描述文字属性的上下文数据以及评分数据,这些数据可用于提高推荐质量。本文提出了一种新颖的方法来改进基于CF的标准推荐者通过使用latentDirichlet分配(LDA)来学习从文本描述中得出的主题的潜在属性(以主题比例表示)。基于历史评分,我们推断用户在相同潜在空间中的主题偏好或角色。在计算用户之间的相似度时,我们使用了包含评分重叠以及潜在主题空间中相似度的组合相似度度量。这种方法减轻了稀疏性问题,因为它可以计算用户之间的相似度,即使他们没有对任何共同的项目评分。我们在多个公共数据集上的实验表明,在分类精度指标(例如精度,召回率和f度量)方面,建议的混合方法明显优于标准的基于用户和基于项目的CF推荐器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号